Introducing the Gab Hate Corpus: defining and applying hate-based rhetoric to social media posts at scale

نویسندگان

چکیده

We present the Gab Hate Corpus (GHC), consisting of 27,665 posts from social network service gab.com, each annotated for presence “hate-based rhetoric” by a minimum three annotators. Posts were labeled according to coding typology derived synthesis hate speech definitions across legal precedent, previous typologies, and psychology sociology, comprising hierarchical labels indicating dehumanizing violent as well indicators targeted groups rhetorical framing. provide inter-annotator agreement statistics perform classification analysis in order validate corpus establish performance baselines. The GHC complements existing datasets its theoretical grounding providing large, representative sample richly media posts.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Detecting the Hate Code on Social Media

Social media has become an indispensable part of the everyday lives of millions of people around the world. It provides a platform for expressing opinions and beliefs, communicated to a massive audience. However, this ease with which people can express themselves has also allowed for the large scale spread of propaganda and hate speech. To prevent violating the abuse policies of social media pl...

متن کامل

Detecting Hate Speech in Social Media

In this paper we examine methods to detect hate speech in social media, while distinguishing this from general profanity. We aim to establish lexical baselines for this task by applying supervised classification methods using a recently released dataset annotated for this purpose. As features, our system uses character n-grams, word n-grams and word skip-grams. We obtain results of 78% accuracy...

متن کامل

Analyzing the Targets of Hate in Online Social Media

Social media systems allow Internet users a congenial platform to freely express their thoughts and opinions. Although this property represents incredible and unique communication opportunities, it also brings along important challenges. Online hate speech is an archetypal example of such challenges. Despite its magnitude and scale, there is a significant gap in understanding the nature of hate...

متن کامل

Surfacing contextual hate speech words within social media

Social media platforms have recently seen an increase in the occurrence of hate speech discourse which has led to calls for improved detection methods. Most of these rely on annotated data, keywords, and a classification technique. While this approach provides good coverage, it can fall short when dealing with new terms produced by online extremist communities which act as original sources of w...

متن کامل

Hate Me, Hate Me Not: Hate Speech Detection on Facebook

While favouring communications and easing information sharing, Social Network Sites are also used to launch harmful campaigns against specific groups and individuals. Cyberbullism, incitement to self-harm practices, sexual predation are just some of the severe effects of massive online offensives. Moreover, attacks can be carried out against groups of victims and can degenerate in physical viol...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Language Resources and Evaluation

سال: 2022

ISSN: ['1574-020X', '1574-0218']

DOI: https://doi.org/10.1007/s10579-021-09569-x